Python For Data Science Cheat Sheet 


Seaborn 
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Statistical Data Visualization With Seaborn 


The Python visualization library Seaborn is based on 


matplotlib and provides a high-level interface for drawing 


attractive statistical graphics. 


Make use of the following aliases to import the libraries: 


>>> import matplotlib.pyplot as plt 
>>> import seaborn as sns 


3) Plotting With Seaborn 


>>> g = sns.FacetGrid(titanic, 
col="survived", 
row="sex") 
>>> g = g.map(plt.hist,"age") 
>>> sns.factorplot(x="pclass", 
y="survived", 
hue="sex", 
data=titanic) 
>>> sns.lmplot(x="sepal width", 
y="sepal_length", 
hue="species", 
data=iris) 


Subplot grid for plotting conditional 
relationships 


Draw a categorical plot onto a 
Facetgrid 


Plot data and regression model fits 
across a FacetGrid 


>>> h = sns.PairGrid(iris) 
>>> h = h.map(plt.scatter) 
>>> sns.pairplot (iris) 
>>> i = sns.JointGrid(x="x", 
y="y", 
data=data) 
>>> i = i.plot(sns.regplot, 
sns.distplot) 
>>> sns.jointplot("sepal length", 
"sepal width", 
data=iris, 


kind='kde') 


Subplot grid for plotting pairwise 
relationships 

Plot pairwise bivariate distributions 
Grid for bivariate plot with marginal 
univariate plots 


Plot bivariate distribution 


Categorical Plots 


The basic steps to creating plots with Seaborn are: 


1. Prepare some data 

2. Control figure aesthetics 

3. Plot with Seaborn 

4. Further customize your plot 


Scatterplot 
>>> sns.stripplot(x="species", 
y="petal_length", 
data=iris) 
>>> sns.swarmplot (x="species", 
y="petal_length", 
data=iris) 
Bar Chart 
>>> sns.barplot(x="sex", 
y="survived", 
hue="class", 
data=titanic) 


Scatterplot with one 
categorical variable 


Categorical scatterplot with 
non-overlapping points 


Show point estimates and 
confidence intervals with 
scatterplot glyphs 


>>> import matplotlib.pyplot as plt Count Plot 
>>> import seaborn as sns i 
j : >>> sns.countplot (x="deck", Show count of observations 
2 " " Step 1 
oe oe 
sns.set_style("whitegri =n " 
= alette="Greens_d 
>>> g = sns.lmplot (x="tip", P Steps) E P a) 
y="total bill", . f 
data=tip5, >>> sns.pointplot (x="class", Show point estimates and 
aspect=2) ="survived", confidence intervals as 
>>> g = (g.set_axis labels("Tip", "Total bill(USD)"). hue="sex", rectangular bars 
set (xlim=(0,10),ylim=(0,100))) data=titanic, 
>>> plt.title ("title") ana ee ei oe 
>>> plt.show pecs ee er 
p 9 <2 DR ay 
linestyles=["-","--"]) 
Boxplot 
Data Also see Lists, NumPy & Pandas >>> sns.boxplot(x="alive", Boxplot 
="age", 


>>> import pandas as pd 
>>> import numpy as np 


>>> uniform data = np.random. rand (10, 
>>> data = pd.DataFrame ({'x':np.arange(1,101 
'y':np.random.norma 


12) 


dy 
1(0,4,100) }) 


Seaborn also offers built-in data sets: 


>>> iris = sns.load_ dataset ("iris") 


>>> titanic = sns.load_ dataset ("titanic") 


hue="adult_male", 
data=titanic) 
>>> sns.boxplot (data=iris, orient="h") 
Violinplot 
>>> sns.violinplot (x="age", 
y="sex", 
hue="survived", 
data=titanic) 


Boxplot with wide-form data 


Violin plot 


@ Figure Aesthetics Also see Matplotlib 


Context Functions 


>>> f, ax = plt.subplots (figsize=(5, 6) 


) 


Create a figure and one subplot 


>>> sns.set_context ("talk") 


Seaborn styles 


>>> sns.set () 

>>> sns.set_style("whitegrid") 

>>> sns.set_style("ticks", 

{"xtick.major.size":8, 
"ytick.major.size":8}) 


>>> sns.axes style("whitegrid") 


Re)set the seaborn default 
Set the matplotlib parameters 
Set the matplotlib parameters 


>>> sns.set_context ("notebook", 


font_scale=1.5, 
rce={"lines.linewidth":2.5}) | override param mapping 


Set context to "talk" 
Set context to "notebook", 
scale font elements and 


Color Palette 


>>> sns.set_palette("husl", 3) 
>>> sns.color_ palette ("husl") 
>>> flatui = 


Define the color palette 
Use with with to temporarily set palette 


["#9b59b6", "#3498db", "#95a5a6", "#e74c3c", "#34495e", "#2ecc71"] 


Return a dict of params or use with 
with to temporarily set the style 


>>> sns.set_ palette (flatui) Set your own color palette 


Regression Plots 


>>> sns.regplot(x="sepal width", 
="sepal_ length", 
data=iris, 
ax=ax) 


Plot data and a linear regression 
model fit 


Distribution Plots 


>>> plot = sns.distplot(data.y, 
kde=False, 
color="b") 


Plot univariate distribution 


Matrix Plots 


>>> sns.heatmap(uniform_data,vmin=0, vmax=1) 


Heatmap 


O Further Customizations Also see Matplotlib 
Axisgrid Objects 


is 
>>> g.despine (left=True) 
>>> g.set_ylabels ("Survived") 
g 
g 


"Sex") 

>>> h.set(xlim=(0,5), 
ylim=(0,5), 
xticks=[0,2.5,5], 
yticks=[0,2.5,5]) 


-set_xticklabels (rotation=45) 
-set_axis labels("Survived", 


Remove left spine 

Set the labels of the y-axis 
Set the tick labels for x 
Set the axis labels 


Set the limit and ticks of the 
x-and y-axis 


>>> plt.title("A Title") 

>>> plt.ylabel ("Survived") 
>>> plt.xlabel ("Sex") 

>>> plt.ylim(0,100) 

>>> plt.xlim(0,10) 

>>> plt.setp (ax, yticks=[0,5]) 
>>> plt.tight_ layout () 


Add plot title 

Adjust the label of the y-axis 
Adjust the label of the x-axis 
Adjust the limits of the y-axis 
Adjust the limits of the x-axis 
Adjust a plot property 

Adjust subplot params 


5) Show or Save Plot Also see Matplotlib 


>>> plt.show() 

>>> plt.savefig("foo.png") 

>>> plt.savefig("foo.png", 
transparent=True) 


Close & Clear 


Show the plot 
Save the plot as a figure 
Save transparent figure 


Also see Matplotlib 


>>> plt.cla() 
>>> plt.clf() 
>>> plt.close() 


Clear an axis 
Clear an entire figure 
Close a window 
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